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Abstract 

We demonstrate that a mutant of uracil DNA glycosylase (N123D:L191A) distinguishes between cytosine and 
methylcytosine. Uracil DNA glycosylase (UDG) efficiently removes uracil from DNA in a reaction in which the base is 
flipped into the enzyme's active site. Uracil is selected over cytosine by a pattern of specific hydrogen bonds, and thymine is 
excluded by steric clash of its 5-methyl group with Y66. The N123D mutation generates an enzyme that excises cytosine. 
This N123D:L191 A mutant excises C when it is mispaired with A or opposite an abasic site, but not when it is paired with G. 
In contrast no cleavage is observed with any substrates that contain 5-methylcytosine. This enzyme may offer a new 
approach for discriminating between cytosine and 5-methylcytosine. 
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Introduction 

Uracil, which is generated by deamination of cytosine [1] 
producing G.U mispairs, is removed from DNA by uracil-DNA 
glycosylase (UDG) [2—4]. UDG is highly specific for uracil and 
shows no activity towards any other base [5]; the base pair partner 
of the U is not recognised and the enzyme also acts on A.U base 
pairs that arise through misincorporation during DNA replication 
[6] . Uracil is flipped out of the duplex into the enzyme's active site, 
followed by cleavage of the N-glycosidic bond [7-9]. This base 
flipping is aided by L191 that inserts into the DNA duplex [8], 
pushing out the uracil and increasing its lifetime in the active site 
[10]. The L191A mutant is less efficient at flipping out uracil [11], 
though the enzymatic activity can be rescued by pairing uracil 
with a bulky synthetic nucleoside that occupies the space of the 
base pair [10,12]. Thymine is excluded from the active site of 
UDG by steric clash between its 5-methyl group and Y66 [9,13]. 

UDG's remarkable specificity for uracil results from specific 
hydrogen bonding [14,15] and shape complementarity 
[11,12,16,17]. In particular N123 forms specific hydrogen bonds 
with 04 and N3 of uracil. Mutation of N 123 to aspartate (N123D) 
alters the hydrogen bond donor-acceptor pattern, allowing for 
recognition of cytosine thereby generating a cytosine DNA 
glycosylase (CDG) as shown in Figure 1A [17]. The double 
mutant (N123D:L191A, designated as CYDG), is unable to excise 
cytosine from a G.C base pair [18]. It has been reported that this 
enzyme only shows CDG activity when C is paired with a bulky 
group such as pyrene, which forces the cytosine into an 
extrahelical conformation [10,12,18]. This mutant still cleaves 
uracil at which it is at least 1000-fold less active than UDG. We 
reasoned that CDG should be able to discriminate between 



cytosine and 5-methylcytosine ( e C) by the same mechanism that 
UDG discriminates between U and T (Figure IB). 

Cytosine methylation, especially at CpG sites, acts as an 
epigenetic marker which affects gene expression and regulation. 
The most commonly used methods for detecting 5-methylcytosine 
are direct sequencing after treatment with bisulphite [19] or 
protection from cleavage by methylation sensitive restriction 
enzymes [20,21]. We have therefore explored whether CYDG 
can discriminate between C and Me C, in the same way that UDG 
discriminates between U and T (Figure 1). We have determined 
the cleavage selectivity of CYDG and show that it can remove 
cytosine, but not methylcytosine, when it is mispaired or opposite 
an abasic site. 

Materials and Methods 

Preparation of Enzymes 

The sequence of Escherichia coli UDG was cloned between the 
EcoRI and Hindlll sites of pUC18. Site-directed mutagenesis 
generated the L191A mutation, which was followed by the N123D 
mutation. The sequence was then subcloned into pET28a and 
inserted between the Ndel and EcoRI sites. The enzyme (CYDG) 
was expressed in BL21(DE3)pLysS cells, which were induced with 
0.2 mM IPTG for three hours. The cells were lysed by sonication, 
purified using a Ni-NTA (His Trap FF Crude; GE Healthcare) 
and eluted in 250 mM imidazole. The enzyme was concentrated 
and further purified using a 20 mL 10000 MW Vivaspin column 
(Fisher Scientific). This produced CYDG that was about 95% 
pure, as estimated by SDS polyacrylamide gel electrophoresis, 
with a yield of 1 .5 mg per litre culture. 
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Figure 1. Interaction of UDG and CYDG with U, T, C and Me C. A) Interaction of U with N123 in uracil DNA glycosylase and proposed 
recognition of C by D123 in the N123D mutant. B) Exclusion of T and Me C caused by steric clash between their 5-methyl groups and Y66 (circled). 
doi:1 0.1 371 /journal.pone.0095394.g001 



Preparation of Oligonucleotides 

Oligonucleotides were synthesized on an Applied Biosystems 
ABI 394 automated DNA/RNA synthesizer on the 0.2 or 1 |J,mol 
scale using standard methods. Phosphoramidite monomers and 
other reagents were purchased from Applied Biosystems or Link 
Technologies. The pyrrolidine anthraquinone phosphoramidite 
was purchased from Berry & Associates. Each 31 mer oligonucle- 
otide was radiolabelled at its 5'-end with y- :i2 P[ATP] using T4 
polynucleotide kinase (New England Biolabs), purified by dena- 
turing PAGE, and resuspended in 10 mM MES pH 6.3 contain- 
ing 25 mM NaCl and 2.5 mM MgCl 2 . These were mixed with an 
excess of the unlabelled complementary oligonucleotides and 
annealed by slowly cooling from 95°C to 4°C. 

Enzyme Cleavage 

Radiolabelled DNA (approximately 50 nM) was incubated with 
CYDG (typically 1.25 |iM) for up to 24 h, removing samples at 
various time intervals. The reaction was stopped using 10% 
piperidine (v/v) and heated at 95°C for 20 min to cleave the 
phosphodiester backbone. The samples were lyophilised, resus- 
pended in 5 |jE loading buffer (80% (v/v) formamide, 10 mM 
EDTA, 10 mM NaOH and 0.1% (w/v) bromophenol blue) and 
run on a 12.5% denaturing polyacrylamide gel containing 8 M 
urea. The gel was then fixed, dried, subjected to phosphorimaging 
and analysed using ImageQuantTL. Experiments were performed 
in triplicate; k cat values were determined from plots of percentage 
cleaved against time, using SigmaPlot, by fitting each set of data to 
a single exponential rise to maximum. These were then averaged 
and the rate constants are reported with ± standard deviation. 
The rate of cleavage of some substrates was very low (less than 
10% cleaved after 24 hours incubation). In these instances an 
estimate of the rate constant was obtained from the fraction 
cleaved at a given time, assuming a simple exponential process. 



Results 

Generation of CYDG (N123D, L191A) 

Initial attempts to prepare the N123D mutant of E. coli UDG, 
which should have CDG activity, were unsuccessful, confirming 
that this enzyme is cytotoxic in E. coli [17,18]. The L191A mutant 
was therefore first introduced into UDG (generating UYDG [7]), 
which was followed by the second N123D mutation to produce 
CYDG. The mutations were generated in pUC18 and then 
subcloned into pET28a followed by expression of the protein in E. 
coli. 

Excision Properties of CYDG 

The activity and specificity of CYDG were tested against a 
range of double and single stranded DNA templates. Synthetic 
31 mer oligonucleotide substrates were designed so as to pair U, 
T, C or M '*C with G, A, AP (abasic site), Z (anthraquinone 
pyrrolidine) or a gap using two 15 mer oligonucleotides (Table 1). 
Previous studies have used a pyrene nucleoside [10,12,18] as a 
plug to force the base into the active site; we used anthraquinone 
pyrrolidine as a similar bulky nucleotide analogue. The results, 
after incubating all the substrates with an excess of the enzyme, are 
shown in Figure 2. Most importantly CYDG shows no activity 
against all the sequences that contain a central methylcytosine, 
confirming that the 5-methyl group of cytosine is excluded from 
the active site in a similar fashion to exclusion of the 5-methyl 
group of T by UDG. In contrast all the sequences with a central 
cytosine are cleaved, except when this is paired with guanine. 

As expected, cleavage is observed when C is place opposite the 
bulky anthraquinone analogue, as previously observed with a 
pyrene nucleotide [18]. More surprisingly, cleavage is also 
observed when C is placed opposite an A, an abasic site or a 
gap, though there is no reaction with a G.C base pair. CYDG has 
residual activity against uracil, even when this is positioned 
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5 ' -GGCTAAATACCGACTXTGCGCACTGATTCGG-3 ' 
3 ' -CCGATTTATGGCTGAQACGCGTGACTAAGCC-5 ' 



Sppp-H.HUs p- p- (J I O s pH& 
UO<<OnOO<<NN<<mm 




Figure 2. CYDG cleavage of 31 mer DNA fragments. The DNA 

fragments contained a central U, T, C or Me C (X) opposite different bases 
(Q). The duplex substrates (—50 nM), which had been labelled with 32 P 
at the 5 '-end of the upper strand, were incubated with —1.25 uM CYDG 
for 24 hours and then cleaved by heating at 95 C C in 1 0% piperidine. The 
products were resolved on a 12.5% denaturing polyacrylamide gel. 
doi:1 0.1 371 /journal.pone.0095394.g002 

opposite adenine, but showed no activity towards thymine in any 
base pair combination. 

Determination of k cat 

In order to assess the best base pair combination for 
discriminating between C and Me C we examined the kinetics of 
cleavage of C by CYDG when it is placed opposite various bases. 
Representative cleavage profiles are shown in Figures 3 and 4 and 
the data are summarised in Table 2. Reaction with the substrate 
containing a single A.C mismatch produced a single product at a 
rate of 0.006 ±0.001 min - . The presence of a single product 
confirms that the enzyme does not cleave C when paired with G 
since this fragment contains several G.C base pairs. CYDG still 
cleaves at A.U and G.U, as previously reported [18], though this is 
much slower than native UDG at these sites. The excision of uracil 
from G.U (0.36±0.04 min - ) is approximately 60-fold faster, but 



the observation that cleavage at A.U (0.020±0.04 min ) is about 
20-fold slower than G.U suggests that the enzyme is best able to 
cleave C or U when they are in unstable (non-W atson-Crick) base 
pair combinations. Anthraquinone pyrrolidine was included 
opposite C so as to force the target base into an extrahelical 
conformation. This produced the fastest cleavage rate at C 
(0.10±0.02 min '), faster even than A.U, though not as fast as at 
G.U; again no reaction is observed at Z. Mc C. These results suggest 
that base pair stability plays a major role in determining the rate of 
cleavage. This is further confirmed by experiments with the 
sequence in which the A.C mismatch is flanked by G.C base pairs 
[A.C(G)] for which cleavage is reduced by about 100-fold 
compared to A.C flanked by A.T base pairs. Fast cleavage was 
also achieved with gap.C (0.016±0.002 min ), which contains a 
gap opposite the C residue, allowing the unpaired cytosine to enter 
the active site of CYDG more easily. However, only 50% of this 
substrate was cleaved (Figure 3), while all other substrates were 
completely digested. This difference is probably due to the lower 
T m of the duplexes formed with these split oligos, which is close to 
the reaction temperature. We therefore examined cleavage of an 
extended DNA substrate that contained an additional five base 
pairs on either side of the central C (long gap.C) (Figure 4). The 
extent of cleavage was improved to 80% with this longer substrate, 
though the reaction proceeded at a slightly slower rate. The lower 
cleavage efficiency may also be because CYDG binds with high 
affinity to the gap on the opposite strand, consistent with the 
observation that UDG has high affinity for AP sites protecting 
them from further mutagenesis during base excision repair [16]. 

We also examined the ability of CYDG to cleave Cs in single 
stranded DNA substrates (Figure 4). We used two substrates 
containing a single cytosine for these experiments; ssC(polyA) 
contains a single C residue within a polydA tract, while ssC(GAT) 
contains a single C within a mixed sequence of G, A and T. 
Although UDG cuts single-stranded Us faster than those paired 
with A or G [22], we observed only very slow cleavage of both 
single-stranded DNAs by CYDG. 

Discussion 

These results show that CYDG, derived from E. coli UDG, is 
able to discriminate between cytosine and 5-methylcytosine; no 
activity against Me C was detected in any of the substrates tested, 
while C is efficiendy cleaved, except when it is paired with G. In 
UDG Y66 is positioned close to the 5 position of the pyrimidine 



Table 1. DNA oligonucleotides used in this study to characterise the cleavage rates of CYDG. 





Substrate 


Sequence 


G.U, A.U, A.C, AP.C, Z.C 


5'-CCGAATCAGTGCGCAXAGTCGGTATTTAGCC 
3'-GGCTTAGTCACGCGTQTCAGCCATAAATCGG 


A.C(G) 


5'-CCGAATCAGTGCGCGCGGTCGGTATTTAGCC 
3'-GGCTTAGTCACGCGCACCAGCCATAAATCGG 


G.C 


5 '-CGAATA ATTATATAAC ATATATATATTTAGC 
3 ' -GCTTATTA ATATATTGTATATATATA A ATCG 


gap.C 


5'-CCGAATCAGTGCGCACAGTCGGTATTTAGCC 
3 '-GGCTTAGTCACGCGT TCAGCCATAAATCGG 


Long gap.C 


5'-CCGTACTGAATCAGTGCGCACAGTCGGTATTTACGATAGCC 
3'-GGCATGACTTAGTCACGCGT TCAGCCATAAATGCTATCGG 


ssC(polyA) 


5'-AAAAAAAAAAAAAAACAAAAAAAAAAAAAAA 


ssC(GAT) 


5'-GGATAAATAGGGAGTCTGAGAAGTGATTAGG 



The target bases are shown in bold and underlined; where X = U or C and Q = G, A, AP (abasic site) or Z (anthraquinone pyrrolidine). 
doi:1 0.1 371 /joumal.pone.0095394.t001 
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Figure 3. CYDG cleavage of fragments containing a central U or C opposite different bases. In each gel the 32 P labelled duplex substrates 
(—50 nM) were incubated with 1.25 uM CYDG for up to 24 hours and cleaved by boiling in 10% piperidine. The products were resolved on 12.5% 
denaturing polyacrylamide gels. The graphs are derived from phosphorimage analysis of the gels and show the rate of formation of the cleavage 
product. These are fitted with single exponential curves. 
doi:1 0.1 371 /journal.pone.0095394.g003 



base and the 5-methyl group is sterically excluded. Alteration of 
the hydrogen bonding pattern at N123 changes the base 
selectivity, but the mutant enzyme is still able to discriminate 
between pyrimidine and 5-methylpyrimidine. The lack of activity 
of CYDG against G.C base pairs therefore suggests the possibility 
of using this enzyme to probe the methylation status of a specific 
cytosine, by mispairing it with another base such as adenine. 

CYDG cleaves cytosine when it is unpaired or mispaired, and 
the stability of the base pair determines the rate of cleavage 
[23,24]. CYDG excised cytosine from Z.C faster than uracil from 
A.U, presumably because the mispaired cytosine is more easily 
forced into an extrahelical configuration than uracil in the 
Watson-Crick A.U pair. The faster cleavage of gap.C and AP.C 
occurs because there is no base opposite the C. If G.C base pairs 
flank the target cytosine then the rate of cleavage at A.C is 
dramatically reduced as a result of the increased local DNA 



stability [25] and the inability of CYDG to flip the base into the 
active site [10-12]. CYDG retains uracil DNA glycosylase activity 
despite the N123D mutation since free rotation of the aspartate 
side chain can still present the correct hydrogen bonding pattern 
for interacting with U [26]. Although the activity of CYDG is 
greatly reduced compared with wild type UDG, its catalytic 
activity is similar to that of many other DNA glycosylases [27-30]. 

The ability of CYDG to excise uracil from A.U but not cytosine 
from G.C suggests that this activity is dependent on the stability of 
the base pair and that the base can move into the enzyme's active 
site when it is not involved in a stable base pair. The major role of 
LI 91 therefore seems to be to plug the space left after base 
flipping, rather than to actively assist the mechanism of base 
flipping itself [11]. The binding of CYDG to the duplex and the 
distortion it causes to the DNA [1 1,16,31] appears to be sufficient 
to destabilise an A.U but not G.C base pairs. 
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Figure 4. Kinetics of CYDG cleavage of fragments containing a central C. In each gel the P labelled duplex substrates (—50 nM) were 
incubated with 1.25 uM CYDG for up to 24 hours and cleaved by boiling in 10% piperidine. The products were resolved on 12.5% denaturing 
polyacrylamide gels. The graph for long gap.C was derived from phosphorimage analysis of the gel and shows the rate of formation of the cleavage 
product; this is fitted with a single exponential curve. 
doi:1 0.1 371 /journal.pone.0095394.g004 



In summary we have shown that CYDG is able to discriminate 
between cytosine and 5-methylcytosine. Gytosine-DNA glycosy- 
lase activity is observed when C is unpaired or in an unstable (non 



Watson-Crick) base pair, while no activity is observed at e C in 
any base pair combination. This enzyme may offer an approach 
for discriminating between cytosine and 5-methylcytosine, in 



Table 2. k cat values for CYDG cleavage of the different DNA substrates. 



Substrate 


kca, (min 


Rel 


A.C 


0.006±0.001 


1.7 


A.C(G) 1 


0.0001 


-0.02 


AP.C 


0.014±0.003 


4.0 


Z.C 


0.10±0.02 


29 


G.C 


ND 


< 0.001 


gap.C 2 


0.016±0.002 


4.6 


Long gap.C 


0.0072±0.0007 


2.0 


G.U 


0.36 ±0.04 


100 


A.U 


0.020±0.004 


5.6 


ssC(polyA) 1 


0.0003 ±0.0001 


-0.07 


ssC(GAT) 1 


0.0001 


-0.02 



The sequences of the oligonucleotides are shown in Table 1 . No cleavage was observed for any substrate containing methylcytosine. ND - no cleavage detected after 24 

hours. Values represent the average of three independent determinations ± standard deviations. 

1 k cat values were estimated from single time points at 24 hrs A.C(G), 60 mins ssC(polyA) and 4 hrs ssC(GAT). 

2 Only 50% of the substrate was cleaved for gap.C. Rel indicates the cleavage rate relative to that of G.U (100). 

doi:1 0.1 371 /joumal.pone.0095394.t002 
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which the methylation status of a specific cytosine is probed by 
annealing it with an oligonucleotide that generates a mismatch, 
such as AC. A cytosine at this position will be cleaved while 
methylcytosine will not; PCR amplification of the reaction 
products can then be used to discriminate between the cleaved 
and uncleaved species. 
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